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NEW APPROACHES TO NUMERACY

half the time and fewer half the time. There are no strong deviations
from the overall pattern. In particular, Ruth's famous 60 home runs in
1927 do not stand out from the other values; it is Babe's best effort but
not unusual in the context of his career.

In contrast, Maris's record of 61 homers in 1961 is an outlier that
falls clearly outside his overall pattern. That overall pattern (excluding
the outlier) is again roughly symmetric and is centered at about 23.
The different locations of the two distributions show Ruth's general
superiority as a home-run hitter.

To see the overall pattern of the distribution of a single variable, we
learn to look for symmetry or skewness, for single or multiple peaks,
for the center and the degree of spread about the center. Important
deviations from a regular pattern include gaps and outliers. Notice that
while constructing the display is an operation to be learned, interpreta-
tion requires judgment.

No distribution of real data has the perfect mirror symmetry of some
mathematical shapes. Not all distributions are well described as either
symmetric or skewed. Too much emphasis on classifying what we see
will frustrate both teachers and students. Learn to observe marked fea-
tures, not to debate unclear features. Note also that looking at data
naturally leads to attempts to interpret what we see, as when we noticed
that Ruth's 60 was not an unusual performance for him, while Maris's
61 was an outstanding achievement far beyond his usual level.

Interpreting the overall shape of a distribution is an important part
of learning to look at data. The histogram in Figure 2 displays student-
collected data on the lengths of words in Popular Science magazine. The
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FIGURE 2. Student-collected data on the length of words in
Popular Science magazine reveal a skewed distribution since
shorter words are more common than longer ones.